Lsa Based Extraction of Semantic Similarity for Polish
نویسنده
چکیده
A recently started project on building the Polish WordNet is the background to the presented work. Here, an initial approach to automatic extraction of semantic similarity relation between Polish lexemes is discussed. The approach follows directly the main lines of Latent Semantic Analysis, but it is modified in order to cope with the rich inflection of Polish and the lack of a corpus comparable with the one used in LSA. The achieved preliminary results are discussed and possible lines of further development are identified. The similarity relation is intended to be a part of a combined algorithm of extraction of wordnet relations.
منابع مشابه
Using LSA and Noun Coordination Information to Improve the Precision and Recall of Automatic Hyponymy Extraction
In this paper we demonstrate methods of improving both the recall and the precision of automatic methods for extraction of hyponymy (IS A) relations from free text. By applying latent semantic analysis (LSA) to filter extracted hyponymy relations we reduce the rate of error of our initial pattern-based hyponymy extraction by 30%, achieving precision of 58%. Applying a graph-based model of noun-...
متن کاملUsing LSA and Noun Coordination Information to Improve the Recall and Precision of Automatic Hyponymy Extraction
In this paper we demonstrate methods of improving both the recall and the precision of automatic methods for extraction of hyponymy (IS A) relations from free text. By applying latent semantic analysis (LSA) to filter extracted hyponymy relations we reduce the rate of error of our initial pattern-based hyponymy extraction by 30%, achieving precision of 58%. Applying a graph-based model of noun-...
متن کاملLatent Semantic Analysis based Language Models for Meetings
Language models that combine N -gram models with Latent Semantic Analysis (LSA) based models have been successfully applied for conversational speech recognition [3] and for the Wall Street Journal recognition task [1]. LSA defines a semantic similarity space using a training corpus. This semantic similarity can be used for dealing with long distance dependencies, which are an inherent problem ...
متن کاملUsing Latent Semantic Analysis to Estimate Similarity
In three studies we investigated whether LSA cosine values estimate human similarity ratings of word pairs. In study 1 we found that LSA can distinguish between highly similar and dissimilar matches to a target word, but that it does not reliably distinguish between highly similar and less similar matches. In study 2 we showed that, using an expanded item set, the correlation between LSA rating...
متن کاملCombination of latent semantic analysis based language models for meeting recognition
Latent Semantic Analysis (LSA) defines a semantic similarity space using a training corpus. This semantic similarity can be used for dealing with long distance dependencies, which are an inherent problem for traditional wordbased n-gram models. Since LSA models adapt dynamically to topics, and meetings have clear topics, we conjecture that these models can improve speech recognition accuracy on...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006